Flexible vector-processing algorithms for numerically solving extreme-scale, linear and non-linear, predictive and prescriptive, problems in science and engineering, on parallel-processing super computers

ABSTRACT

A computer-implemented method for numerical solution of a geometric programming problem is described, including the computer-implemented steps of: reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and solving the equivalent generalized geometric programming optimization problem by vector processing, including determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and includes a variable linear combination of fixed vectors enabling the vector processing. Also described are computer-readable storage devices, computer program products, and computer systems for such numerical solution methodology.

CROSS-REFERENCE TO RELATED APPLICATIONS

This is a continuation-in-part application under 35 U.S.C. §120 of International Patent Application No. PCT/US2015/023734 filed on Mar. 31, 2015, which in turn claims the benefit of U.S. Provisional Patent Application No. 61/973,232 filed on Mar. 31, 2014 in the name of Elmor L. Peterson for “FLEXIBLE VECTOR-PROCESSING ALGORITHMS FOR NUMERICALLY SOLVING EXTREME-SCALE, LINEAR AND NON-LINEAR, PREDICTIVE AND PRESCRIPTIVE, PROBLEMS IN SCIENCE AND ENGINEERING, ON PARALLEL-PROCESSING SUPER COMPUTERS”. The disclosures of International Patent Application No. PCT/US2015/023734 and U.S. Provisional Patent Application No. 61/973,232 are hereby incorporated by reference herein in their respective entireties.

FIELD

The present disclosure relates to new vector parallel-processing algorithms for numerically solving extreme-scale scientific and engineering problems not currently solvable in real time by previous algorithms whose computational complexity has been too high due to their inability to exploit modern parallel-processing super computers. These new algorithms utilize recent discoveries about the underlying signature characteristics of generalized geometric programming, a field that has been evolving for 50 years with the assistance of the present inventor.

DESCRIPTION OF THE RELATED ART

In the 1960s, commercial electrical power distribution networks had sufficiently increased in scope and complexity to require electric utility companies to custom design and fabricate large-scale power-distribution transformers that would satisfy industrial and residential power demands.

At that time, new mathematical formulations were developed for generalized-polynomial objective and constraint functions to achieve transformer models that provided minimum combined fabrication and expected life-time operating costs. This development, involving the efforts of Clarence M. Zener, Richard J. Duffin, and Elmor L. Peterson, gave rise to the field of field of geometric programming, and a seminal text, “Geometric programming—theory and application,” R. J. Duffin, E. L. Peterson, and C. M. Zener, John Wiley & Sons, Inc., New York-London-Sydney, 1967.

Geometric programming was subsequently embraced by the engineering design and construction industry, with such early successes as the design and construction by Bechtel Corporation of the national water storage and distribution system of Algeria in 1979, in which geometric programming modeling of the system was attributed as achieving cost savings, in relation to prior conventional design approaches, on the order of $50 million (equal to approximately $150 million in current U.S. dollars).

At present, geometric programming modeling and techniques are widely used in a variety of fields to optimally design and operate systems and their components, and commercial computer software programs implementing geometric programming modeling are readily available from a variety of vendors, including the MOSEK optimization software commercially available from Mosek ApS (Copenhagen, Denmark) for solving general convex nonlinear problems, conic quadratic problems, and semidefinite problems involving positive semidefinite matrix variables, among others, and Mosek ApS's optimization toolbox for MATLAB. Examples of current use of geometric programming techniques include optimal design of electronic microprocessors and integrated circuitry.

As a discipline, geometric programming has enabled reformulation and subsequent transformation (via geometric programming “conjugate duality” techniques) of the widely used “Shannon Information Theory” as well as Boltzmann thermodynamics and statistical mechanics in which entropy optimization is utilized to infer the least biased probability distribution underlying a particular statistical information.

Despite its substantial success, the utility of existing geometric programming techniques has been limited to moderate- and large-scale problems involving up to tens of thousands of decision variables, but its implementation for solution of extreme-scale (sometimes referred to as exa-scale or mult-scale) problems involving millions and even the billions of decision variables has not been practically achieved.

SUMMARY

The present disclosure relates to parallel processing computer systems and parallel processing computer-implemented methods for numerical solution by flexible vector-processing algorithms, affording capability for numerical solution of extreme-scale geometric programming problems.

The present disclosure reflects the discovery of hidden properties of linearity, convexity, and separability found to be present in all geometric programming problems, and the correlative development of qualitative and quantitative exploitations of such hidden properties, involving uncovering, reformulation, and transformation of such hidden properties in unexpected and surprisingly effective ways.

In one aspect, the present disclosure relates to a computer-implemented method for numerical solution of a geometric programming problem, comprising the computer-implemented steps of:

reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and

solving the equivalent generalized geometric programming optimization problem by vector processing, comprising determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.

In another aspect, the disclosure relates to a computer-readable storage device embodying a non-transitory program of machine-readable instructions executable by a digital processing apparatus to perform a method of numerical solution of a geometric programming problem, the method comprising:

reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and

solving the equivalent generalized geometric programming optimization problem by vector processing, comprising determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.

In a further aspect, the disclosure relates to a computer program product comprising a computer program embodied in a tangible, computer-readable medium, for enabling numerical solution of a geometric programming problem, the computer program comprising:

instructions for reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and

instructions for solving the equivalent generalized geometric programming optimization problem by vector processing, comprising instructions for determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.

A still further aspect of the disclosure relates to a computer system comprising a memory and parallel processors that are programmatically adapted by the above-described computer program product to generate the numerical solution of the geometric programming problem.

Yet another aspect of the disclosure relates to a computer-implemented method for generating a solution output on a graphical user interface or other output component of a programmed vector parallel-processing computer to a mixed integer linearly constrained convex programming (MILCCP) application, wherein the programmed vector parallel-processing computer is programmed to reduce the MILCCP application to an unconstrained convex programming application, and wherein said method comprises automatic operation of the programmed vector parallel-processing computer to reformulate the MILCCP application as an equivalent convex generalized geometric programming (GGP) problem whose corresponding conjugate dual problem is unconstrained, and automatic operation of the programmed computer to solve the GGP problem with an unconstrained convex programming algorithm, by vector parallel-processing.

Additional aspects of the disclosure relate to a programmed vector parallel-processing computer programmed to carry out such method, and computer-readable media and processors containing instructions for carrying out such method in a vector parallel-processing computer.

Other aspects, features and embodiments of the disclosure will be more fully apparent from the ensuing description and appended claims.

DETAILED DESCRIPTION

The present disclosure relates to parallel processing computer systems and computer-implemented parallel processing methods for solution of heretofore unsolvable extreme-scale geometric programming problems.

As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.

As discussed hereinabove, the present disclosure is based on the discovery that hidden properties of linearity, convexity, and separability are ubiquitous in geometric programming problems, and the resulting capability to develop exploitations by uncovering, reformulating, and transforming these hidden properties to achieve numerical solutions and optimal design of extreme-scale systems and their components.

In one aspect, the present disclosure relates to a computer-implemented method for numerical solution of a geometric programming problem, comprising the computer-implemented steps of:

reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and

solving the equivalent generalized geometric programming optimization problem by vector processing, comprising determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.

In such method, the geometric programming problem, i.e., primal problem, may be of any suitable type, and may for example comprise at least one of design, modeling, and optimization of a system, e.g., an exa-scale system. In various embodiments, the geometric programming problem may comprise a stochastic linear programming optimization problem. Specific examples of geometric programming problems susceptible to solution by the methodology of the present disclosure include multi-scale climate modeling, modeling of controlled nuclear fusion, and the design, optimization, and modeling applications disclosed hereinafter. Specific geometric programming problems amenable to solution by the method of the present disclosure include problems such as economic equilibration, physical equilibration, profit maximization, and cost minimization.

The numerical solution methodology of the present disclosure is more fully described hereinafter, and may for example comprise convex optimization of a non-convex primal geometric programming problem. The methodology may comprise Rockafellar bi-function programming, fuzzy optimization techniques, and other geometric programming approaches. In various embodiments, the methodology is conducted to produce an optimal solution that is used to solve a deterministic primal problem of the geometric programming problem by a linear programming technique.

The computer-implemented steps of the method of the present disclosure may be conducted on a parallel processing computer, e.g., a massively parallel supercomputer.

The methodology of the present disclosure can be implemented using a suitable programmably configured parallel-processing super computer. The parallel-processing super computer may be of any suitable type that is constructed, arranged, and programmed to perform flexible vector-processing algorithms for numerically solving extreme-scale, linear and non-linear, predictive and prescriptive, problems in accordance with the present disclosure.

In various embodiments, the parallel-processing super computer may comprise a massively parallel vector processing computer of a type as described in U.S. Pat. No. 4,891,751 issued Jan. 2, 1990 to Duane B. Call et al. Such massively parallel vector computer comprises a set of vector processing nodes, each including a main processor and a main random access memory (RAM) connected to an internal bus, with each node also including a vector memory and a programmable vector processor connected to a local bus, and the local and internal busses communicating via a local interface port controlled by the main processor.

In other embodiments, the parallel-processing super computer may comprise an ultrascalable petaflop parallel supercomputer as described in U.S. Pat. No. 7,761,687 issued Jul. 20, 2010 to Matthias A. Blumrich, et al., in which the super computer includes node architectures based upon system-on-a-chip technology, where each processing node comprises a single application specific integrated circuit and nodes are interconnected by multiple independent networks that optimally maximize the throughput of packet communications between nodes with minimal latency. The multiple networks may include high-speed networks for parallel algorithm message passing, with the multiple independent networks being collaboratively or independently utilized according to the needs or phases of an algorithm for optimizing algorithm processing performance.

The disclosure contemplates as an additional aspect of the technology described herein, a computer-readable storage device embodying a non-transitory program of machine-readable instructions executable by a digital processing apparatus to perform a method of numerical solution of a geometric programming problem, wherein the method comprises:

reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and

solving the equivalent generalized geometric programming optimization problem by vector processing, comprising determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.

The computer-readable storage device may be of any suitable type, and may for example comprise a device selected from the group consisting of random access memory, magnetic data storage diskettes, CD discs, hard drives, RAID arrays, magnetic tape, electronic read-only memory, and optical storage devices.

In a further aspect, the disclosure relates to a computer program product comprising a computer program for enabling numerical solution of a geometric programming problem. Such computer program comprises:

instructions for reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and

instructions for solving the equivalent generalized geometric programming optimization problem by vector processing, comprising instructions for determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.

Another aspect of the disclosure relates to a computer system comprising a memory and parallel processors that are programmatically adapted by the computer program product of the present disclosure, as described above, to generate the numerical solution of the geometric programming problem. Such computer system thus may be adapted to generate the numerical solution of a geometric programming problem comprising at least one of design, modeling, and optimization of an exa-scale system, e.g., a stochastic linear programming optimization problem.

The systems and methodology of the present disclosure enable numerical solution of previously unsolvable extreme-scale geometric programming problems, by use of flexible vector-processing algorithms. These vector-processing algorithms derive from the conjugate duality of generalized geometric programming (GGP), each of whose optimization problems, including those with nonlinear constraints, is reformulated as an equivalent GGP optimization problem with only linear constraints, by highly effective non-trivial transformations.

For ease of description, these algorithms will initially be described in the relatively simple context of linearly constrained GGP. Since linear programming (LP) is the simplest form of this relatively simple GGP programming type, it will be readily apparent that the algorithms of the present disclosure are ideally suited to solving, in real time, many extreme-scale LP optimization problems that have heretofore resisted solution by previous LP algorithms as a result of their inherently high computational complexity.

The new GGP algorithms of the present disclosure have the potential to replace all currently employed numerical linear-algebraic algorithms for solving general extreme-scale linear systems, viz., systems containing only linear equations and/or linear inequalities. The solution methodologies of the present disclosure achieve a substantial advance in the art, since they require only the numerical computation of a critical solution for an unconstrained objective function whose independent vector variable (essentially a GGP conjugate dual of the given (primal) decision vector variable) is a variable linear combination of appropriate fixed vectors, thereby enabling elementary vector processing to be employed.

For the resulting LP and numerical linear-algebraic algorithms, the GGP conjugate dual and primal objective functions for each individual LP optimization problem or linear system problem can be chosen from an infinite number of candidate functions, in which the selection is made to further simplify the vector parallel processing and/or exploit any special system structure, e.g., matrix sparsity.

Such infinite flexibility renders the solution methodologies of the present disclosure substantially more general in character than previously developed iterative solution methodologies for solving large-scale linear systems, such as interior point methodologies that have been developed for linear programming, and conjugate-gradient methods for symmetric positive-definite systems of linear equations, which are limited in application to a very narrow class of linear equations with no inequalities. The flexibility of the solution methodologies of the present disclosure makes such methodologies suitable for robust numerical solution of previously unsolvable general stochastic LP optimization problems and linear systems problems with uncertainty.

The disclosure hereinafter provides a brief overview of fundamentals and basic theory of GGP, within the familiar context of linear algebra and elementary multi-variable differential calculus, by way of background to the subsequent discussion of the vector processing systems and methods of the present disclosure. The uncovering, reformulating, and transforming of the signature characteristics of linearity, convexity, and separability, as utilized in the systems and methods of the present disclosure, will also be illustrated using elementary differential calculus and introductory linear algebra.

GGP, although developed for general nonlinear optimization, is addressed below in connection with new solution methodologies for general linear systems, as providing (i) vector processing, not available in previously developed methodologies such as interior-point methodologies for linear programming, for solving linear systems containing only linear equations and/or linear inequalities, and (ii) complete closed-form solutions to uncertainty problems for general linear systems, including distribution and robust problems for stochastic LP, using vector processing.

Considering general linear systems, solution methodologies for equations of the following type:

Given A∈

and b∈

, solve Aχ=b for χ∈

include approaches such as Gaussian pivoting and related methodologies, and the Hestenes-Lanczos-Stiefel conjugate-gradient method.

The Gaussian pivoting method with [A|b] entails no requirement as a constraint on A, and is known to provide exact solutions, if they exist, when using exact arithmetic, with cubic computational complexity. Nonetheless, any early termination, as is frequently needed for large-scale systems, generally results in no usable approximate solution.

Using the Hestenes-Lanczos-Stiefel conjugate-gradient method requires m to be equal to n, and requires A to be both symmetric and positive definite. This approach minimizes (½)χ^(T)Aχ−b^(T)χ for χ∈

via conjugate-gradient iterations, an extremely efficient refinement of steepest-descent iterations technique that is valid for strictly convex quadratic minimization, but possibly no other type of optimization. The optimal solutions of such conjugate-gradient iterations methodology automatically satisfy the first-order optimality condition Aχ−b=0 and hence Aχ=b. The Hestenes-Lanczos-Stiefel conjugate-gradient method is known to provide an exact solution, which always exists and is unique, when using exact arithmetic, with only n conjugate-gradient iterations. Any early termination necessitated by application to large-scale systems will frequently result in a usable approximate solution whose accuracy is easily determined at the end of each iteration. The Hestenes-Lanczos-Stiefel conjugate-gradient method is also known to provide super-linear convergence when using finite-precision arithmetic.

In contrast to the foregoing techniques, the new GGP methodologies of the present disclosure involve no requirement on A. Since the most general geometric programming problem required to completely treat the general equality system Aχ=b needs to be defined in terms of general orthogonal complementary subspaces, reformulating the system Aχ=b in a homogenous format as Aχ−b=0 suggests using the slightly modified augmented matrix [A|−b] along with its null space

X=

x∈

|[A|−b]x=0

and its row space

Y=

y∈

|y ^(T) =z ^(T) [A|−b] for some z∈

,

which are orthogonal complementary subspaces in

. Since each vector x in this null space X that has a non-zero (n+1)^(th) component x_(n+1) clearly provides a solution χ to Aχ−b=0 that is simply the first n components of x/x_(n+1), it seems appropriate for this null space X to be used in the definition of the relevant primal generalized geometric programming problem, while the corresponding row space Y is used in the definition of a corresponding dual generalized geometric programming problem, namely, the extremely general (optimization-like) equilibrium problem.

The following dual GGP problem Q is next considered: given an arbitrary real-valued function h:D with a nonempty domain D⊂

(and with a value h(y) for each y∈D), compute all critical points, if any, for the resulting composite function f:F whose domain

F=

z∈

|z ^(T) [A|−b]=y ^(T) for some y∈D

and whose value

f(z)=h(y) when y ^(T) =Z[A|−b] for some Z∈F,

wherein the key computation of y^(T) from z^(T) can be efficiently accelerated with the aid of vector parallel-processing.

The relevance of this dual GGP problem Q to providing solutions to the primal equality system Aχ−b=0 is an immediate consequence of the following observation. If problem Q is consistent, i.e., if its feasible solution set F is not empty because Y∩D is not empty, and if it has a critical solution z_(cr) whose image y_(cr) ^(T)=z_(cr) ^(T)[A|−b] is an interior point of D at which h is differentiable, then the chain rule implies that

[A|″b]∇h(y _(cr))=0

and hence that ∇h(y_(cr)) is in the null space of [A|−b]. If, in addition, ∇h(y_(cr)) has a non-zero (n+1)^(th) component (that is, if the (n+1)^(th) partial derivative h_(n+1)(y_(cr))≠0), then the first n components of ∇h(y_(cr))/h_(n+1)(y_(cr)) clearly provide a solution χ to Aχ−b=0.

The scope of this observation can be greatly enlarged by replacing the differentiability assumption about h:D at a critical solution y_(cr) with a generalized differentiability assumption about h:D at y_(cr) using the non-smooth analysis of Clarke, but such extra flexibility requires substantially more sophisticated analyses. Instead, a simpler approach is taken to ensure, without much loss of generality, the automatic satisfaction of the key secondary assumption, h_(n+1)(y_(cr))≠0, in this observation. This is done by restricting the general form of the function h:D in problem Q to the special partially separable form

D=D×

with

(y ₁ , . . . ,y _(n))=y∈D and y _(n+1)∈

,

where D⊂

is the nonempty domain for some arbitrary real-valued function h:D for which

h(y)=h(y)+y _(n+1).

In particular, since h_(n+1)(y_(cr)) is then identically one, h_(n+1)(y_(cr))≠0 and the solution χ to Aχ−b=0 consisting of the first n components of ∇h(y_(cr))/h_(n+1)(y_(cr)) is now just ∇h(y_(cr)) where y_(cr) is simply the first n components of y_(cr).

The following examples are provided.

Example 1

Choosing D=

and h(y)=(½)={Σ₁ ^(n)y_(j) ²} in the contracted version h:D of h:D implies that ∇h(y)=(y₁, y₂, . . . , y_(n))^(T) is a solution χ to Aχ−b=0 when y=y_(cr), which must be an optimal solution to the convex quadratic minimization problem

Minimize(½)(z ^(T) A)(z ^(T) A)^(T) −z ^(T) b for z∈

or, equivalently,

Minimize(½)z ^(T)(AA ^(T))z−b ^(T) z for z∈

This problem, which is termed GMRES in the literature, is solvable via the conjugate-gradient algorithm when A has full row rank, but is likely best solved by other linear-algebraic methodology because any sparsity of A might be absent in AA^(T). The vector parallel-processing of the computation of y^(T) from z^(T) in this GGP Problem Q is enhanced by the completely separable form of this example function h:D, thereby establishing that GGP is clearly the superior methodology.

Example 2

Choosing D=

and h(y) to be a constant in the context of a system Aχ−b=0 known to have a non-zero solution χ shows that solutions χ cannot always be produced by a solution y_(cr) of the dual GPP Problem Q (because ∇h(y_(cr))=0), unless h:D is carefully selected.

In particular, functions h:D that are guaranteed to produce at least one solution χ to every system Aχ−b=0 that has a solution are most easily described in terms of the conjugate transform g:C of the expanded version h:D of a given function h:D, wherein

C=

x∈

|sup_(y∈D) [x ^(T) y−h(y)]<+∞

and

g(x)=sup_(y∈D) [x ^(T) y−h(y)] for x∈C.

Such functions g:C and h:D are called “conjugate functions” because h:D is also the conjugate transform of g:C when h:D is closed and convex, a non-obvious symmetry that seems plausible because of the symmetry of the conjugate inequality

x ^(T) y≦g(x)+h(y) for each x∈C and each y∈D,

which is an elementary consequence of the preceding definition of g:C in terms of h:D.

Now, a non-obvious fact derivable from GGP duality theory is that those closed convex functions h:D for which the origin 0∈(interior D), and that also have a conjugate transform g:C for which C=

×

1

, are guaranteed to produce at least one solution χ to every system Aχ−b=0 that has a solution.

Example 3

Choosing, as in the previous example,

D=

and h(y)=(½){Σ₁ ^(n) y _(i) ² }+y _(n+1)

implies, by differential calculus and the complete separation of h:D, that

C=

×

1

and g(x)=(½){Σ₁ ^(n) x _(i) ²}+0

and hence that this GGP approach produces at least one solution χ to every system Aχ−b=0 that has a solution.

Since there are infinitely many choices of closed convex functions h:D for which 0∈(int D), and for which C=

×

1

, there are infinitely many GGP dual variational principles that can be used to solve a given system Aχ−b=0, all of which use vector parallel-processing of the computation of y^(T) from z^(T). Considering the GP duality theory discussed above, it is noted that the conjugate inequality derived above clearly implies the duality inequality

0≦g(x)+h(y) for each z∈X∩C and each y∈Y∩D,

because X and Y are orthogonal complementary subspaces in

. This inequality expresses the main part of the weak duality theorem for the following pair of generalized geometric programming dual problems:

Primal GGP Problem P:

Compute both the primal infimum

φ=inf_(x∈X∩C) g(x)

and the primal optimal solution set

S*=

x″∈X∩C and g(x)=φ

.

Dual GGP Problem Q:

Compute both the dual infimum

ψ=inf_(y∈Y∩D) h(y)

and the dual optimal solution set

T*=

y|y∈Y∩D and h(y)=ψ

.

The strong duality theorem for generalized geometric programming specialized to, and tailored for, the dual variational principles described herein is set out in the following Key Theorem.

Key Theorem

Given that 0∈(int D), Problem Q is always super consistent, but is bounded if, and only if, the system Aχ−b=0 has a solution χ. Given also that C=

×

1

, the system Aχ−b=0 has a solution χ if, and only if, Problem P is consistent; in which case Problem Q has a non-empty optimal solution set T*, and there is a solution χ* to the system Aχ−b=0 that is a subgradient of h(y) at y=y* for some y* in T*, with this subgradient χ* simply being the gradient ∇h(y*) when h(y) is differentiable at that y=y*. Moreover, each such solution χ* minimizes g(χ) over the set of all solutions χ to the system Aχ−b=0.

By virtue of the theory of dual polyhedral (finitely generated) cones and a much more general duality theory of generalized geometric programming, essentially the same dual variational principles are valid when the i^(th) equation a_(i)χ−b_(i)=0 in Aχ−b=0 is replaced in the definition of X by the inequality a_(i)χ−b₁≧0, provided that the i^(th) row [a_(i)|−b_(i)] in [A|−b] is multiplied by a non-negative scalar variable z_(i)≧0 when defining Y as a linear combination of the rows of [A|−b].

Consequently, since a general linear optimization problem can always be solved to within any given absolute accuracy ε≧0 simply by solving a linear system consisting of the linear primal and dual feasibility conditions along with the condition that the algebraic difference between the linear primal and dual objective functions does not exceed the given ε, this general duality methodology applies to all linear optimization problems, including, most importantly, extra-large scale problems.

Addressing now issues of uncertainty in solving systems of linear equations, for the system

Given A∈

and b∈

, solve Aχ=b for χ∈

,

consider A to be arbitrary but fixed, while b is uncertain and hence stochastically confined to the sample space Ω=

. For this sample space Σ is a sigma algebra of events E, with σ:Σ→[0, 1] its associated probability distribution function typically generated by integrating a given probability density function ρ:Ω→[0, 1] over each relevant E. An analyst responsible for providing a solution χ* to this system for a decision-maker should provide the following fundamental information necessary for a rational decision: (1) the probability that this system has a solution, since if this probability is too low, the decision maker may want to consider preliminary actions that change σ:Σ→[0, 1], e.g., by changing ρ:Ω→[0, 1]; and (2) given that this system has a solution, a certainty when the rank of A is m, the most probable solution(s) χ* and related statistics.

Regarding the probability that this system has a solution, it is noted that the consistency event

C=

b|b=Aχ for some χ∈

,

is simply the column space of A for which there are known algorithms that provide a set of basis vectors for C that are computationally conducive to integrating numerically, if not analytically, a given probability density ρ(b) over C to determine the associated consistency probability σ(C).

Addressing now the most probable solution, if σ(C)>0, simply maximize ρ(b) over the primal vector space

X==

(χ,b)∈

|[A|−I](χ,b)=0

intersected with C, which is just X itself. The equivalent minimization of −ρ(b) over X is an elementary generalized geometric programming problem whose vector-processing solution via GGP duality involves linear combinations of the rows of [A|−I], rather than the rows of [A|−b] for a particular value of b. A resulting most probable solution χ* taken from any optimal solution (χ, b)* can be expected to be robust for normal Gaussian probability densities ρ(b), as well as for many other important density types.

Considering now distribution and robust problems in stochastic linear programming with essentially no loss of generality, an illustrative canonical linear optimization problem is set out below.

Primal Problem P(A, b, c, d):

maximize c ^(T) x+d subject to Ax≦b and x≧0,

wherein A in

is arbitrary but fixed, while b in

, c in

, and d in

are all stochastic with (b, c, d) in

confined to the sample space Ω=

for which Σ is a given sigma algebra of events E with σ:Σ→[0, 1] its associated probability distribution function. This associated probability distribution function is typically generated by integrating a given probability density function ρ:Ω→[0, 1] over each relevant E.

An analyst responsible for providing an optimal solution to this primal problem for a decision-maker should provide the following fundamental information necessary for a rational decision:

(1) the probability that Problem P(A, b, c, d) is consistent, since if this probability is too low, the decision maker may want to consider preliminary actions that change σ:Σ→[0, 1], e.g., by changing ρ:Ω→[0, 1]; (2) given that Problem P(A, b, c, d) is consistent, the conditional probability that Problem P(A, b, c, d) is bounded, e.g., as is likely to be the case for realistic profit-maximization problems; and (3) given that Problem P(A, b, c, d) is both consistent and bounded, the most probable optimal solution(s) x* and related statistical information.

The probability that Problem P(A, b, c, d) is consistent can be addressed by introducing a slack variable s∈

^(m). The relevant primal consistency event

C _(P)=

(b,c,d)|b=Ax+s for some x,s≧0;c∈

;d∈

is a finitely generated cone for which there are known algorithms that express its vectors in computationally simple ways that are conducive to integrating numerically, if not analytically, given probability densities ρ(b, c, d) over C_(P) to determine the associated consistency probability σ(C_(P)).

The probability that Problem P(A, b, c, d) is both consistent and bounded then can be determined, since it is known from linear optimization duality theory that a consistent Problem P(A, b, c, d) is bounded if, and only if, its linear dual Dual Problem Q(A, b, c, d):

minimize b ^(T) y+d subject to A ^(T) y≧c and y≧0,

is consistent. A formula for the corresponding dual consistency event C_(Q) (analogous to the formula above for C_(P)) is utilized, with introduction of a dual slack vector variable t∈

for Problem Q(A, b, c, d). The resulting formula for C_(Q) is combined with the formula for C_(P). The primal consistency and boundedness event

B=

(b,c,d)|b=Ax+s for some x,s≧0;

c=A ^(T) y−t for some y,t≧0;d∈

is another finitely generated cone for which there are known algorithms that express vectors in computationally simple ways that are conducive to integrating numerically, if not analytically, given probability densities ρ(b, c, d) over B to determine the associated boundedness probability σ(B).

In summary, the probability σ(E) of each of these events E, each a finitely generated cone and hence also a polyhedral cone, can be computed by integrating analytically or numerically an appropriate probability density ρ(b, c, d) over each such cone E. To make these integrations as simple as possible, one of the elementary representations for a given finitely generated cone E can be used. Each such representation consists of unique linear combinations of finitely many elementary basis vectors, for the unique largest subspace of E, plus unique non-negative linear combinations of finitely many extreme recession-direction vectors, for an appropriate finitely generated subcone of E that is pointed.

Regarding the most probable solution, if σ(B)>0, then maximize ρ(b, c, d) subject to the linear primal and dual feasibility conditions, as a generalized geometric programming problem whose vector-processing solution via GGP duality produces an optimal solution (b, c, d)* that then is used to solve the deterministic Primal Problem P(A, b*, c*, d*) by any linear programming method. The resulting most probable solution x* can be expected to be robust for normal Gaussian probability densities ρ(b, c, d), as well as for many other important density types.

The above discussed methodology can be extended to and implemented in other linearly constrained applications resulting in vector parallel processing, including, for example: alternatives to the Dantzig-Wolfe Decomposition Principal; optimal control with linear dynamics and dynamic programming with linear transition equations; Kalman filters and filters of the same general type; and linear complementarity and mixed integer linear programming problems.

The above-discussed methodology can also be extended to general non-linearly constrained convex optimization problems, by the conjugate duality and associated parametric programming and post-optimality analyses for either generalized geometric programming, GGP (including the prototype algebraic GGP case, AGP) or Rockafellar bi-function programming. Additionally, the present disclosure contemplates corresponding approaches for fuzzy optimization methodologies.

The vector processing algorithmic methodology of the present disclosure represents a major advance in numerical solution of general linear and continuously non-linear equilibration and optimization problems, both deterministic and stochastic. By reformulating and transforming signature characteristics of geometric programming, vector parallel processing techniques are made applicable to general linear and continuously-nonlinear algebraic optimization, in a manner that reduces optimization problems to equivalent forms in which convexity, linearity, and separability are utilized in transformed objective and constraint functions having a simple two-term formula, f(x,y)=log [c exp (x)+d exp (y)], in which c and d are non-negative coefficients, and x and y are scalar components of a decision vector variable v constrained to lie in a sub-space V of Euclidean space

whose dimensions m and n increase with problem size. In this formula, f(x,y) is both convex and asymptotically linear, being the simple linear function x+log [c] when d=0.

This function f(x,y) is essentially separable because the monotonicity of the log function justifies its replacement by the separable function [c exp(x)+d exp(y)] when separability is more desirable than near linearity. In this context, general linear optimization can be viewed as the extremely special case when d=0 for each function f(x,y). Software for general linear and non-linear applications of the present methodology may include a coded front-end module that accepts as input only a coefficient vector (with components consisting of all of the non-negative coefficients c and d) and an m×n generating or basis matrix (for the sub-space V), whose partitions are compatible with those that are determined by the objective and all of the constraint functions.

Since the Stone-Weirstrause Approximation Theorem and related regression methodology show how to transform all optimization problems with continuous objective and constraint functions (defined on compact domains) into equivalent algebraic optimization problems with only polynomial objective and constraint functions, the GP methodology of the present disclosure actually is much more general and useful when combined with appropriate regression methodology. The vector processing methodology of the present disclosure may be usefully applied to stochastic linear and non-linear optimization, with applicability to continuous optimization problems that have integer decision variables or other non-continuous types of combinatorial constraints.

The methodology of the present disclosure, relying on the hidden linearity and near linear convex separability in algebraic optimization problems, as uncovered, reformulated, and transformed for vector parallel processing and cloud computing, represents a recognition of the true nature of geometric programming, including original prototype GP, algebraic geometric programming (AGP), and its various generalizations, generalized geometric programming (GGP) is further illustrated in the following elementary problem examples. These examples are set out in increasing level of sophistication and abstraction, to show the increasingly subtle signature characteristics of both AGP and GGP.

Example 4

The optimal shipment of gravel across a river is desired to be determined. The gravel is shipped on a ferry boat that makes any number of trips, using a custom-designed cylindrical gravel container having a radius t₁ in feet (an independent design variable) and a height t₂ in feet (an independent design variable). The circular bottom and top members of the container cost $A/ft², and the cylindrical side member costs $B/ft², where A and B are given exogenous parameters. The capital cost therefore is A2πt₁ ²+B(2πt₁)t₂. V ft³ of gravel is to be transported, and the ferry costs $R/round-trip, where V and R are given exogenous parameters. The operating cost is R(V/πt₁ ²t₂).

The problem is to find the design vector variable t=(t₁, t₂)>0 that minimizes the total cost P(t)=c₁t₁ ²+c₂t₁t₂+c₃/t₁ ²t₂ for the given exogenously determined coefficients c₁=2πA, c₂=2πB, and c₃=(RV/π). Since this objective function P(t) is an algebraic function that can also be expressed as P(t)=c₁t₁ ²t₂ ⁰+c₂t₁ ¹t₂ ¹+c₃t₁ ⁻²t₂ ⁻¹ and hence can be represented more concisely by the

${{coefficient}\mspace{14mu} {vector}\mspace{14mu} c} = {\begin{pmatrix} c_{1} \\ c_{2} \\ c_{3} \end{pmatrix}\mspace{14mu} {and}}$ ${{{exponent}\mspace{14mu} {matrix}\mspace{14mu} A} = \begin{pmatrix} 2 & 0 \\ 1 & 1 \\ {- 2} & {- 1} \end{pmatrix}},$

it is an example of a generalized polynomial with general real exponents. Moreover, since c=(c₁, c₂, c₃)>0 and t=(t₁, t₂)>0, the objective function P(t)>0, even though some of its exponents a_(ii) are not positive, and thus P(t) is a positive polynomial.

At this juncture, the hidden linearity and nearly linear convex separability in P(t) are not yet visible and therefore are still to be uncovered.

One traditional approach for minimizing P(t) directly involves a direct search descent algorithm. This approach requires specific values for c=(c₁, c₂, c₃), and provides an optimal design t*=(t*₁, t*₂)>0 and an optimal value P*=P(t*)>0 that is valid for only a very limited range of the parameter values A, B, R, and V. Another, earlier traditional approach is to minimize P(t) indirectly by finding a solution t* to the optimality condition ∇P(t)=0, which is the non-linear algebraic system

2c ₁ t ₁ ¹ t ₂ ⁰ t ₂ ⁰+1c ₂ t ₁ ⁰ t ₂ ¹−2c ₃ t ₁ ⁻³ t ₂ ⁻¹=0

0c ₁ t ₁ ² t ₂ ⁻¹+1c ₂ t ₁ ¹ t ₂ ⁰−1c ₃ t ₁ ⁻² t ₂ ⁻²=0.

Such non-linear systems usually could not be solved analytically prior to the advent of geometric programming, termed prototype GP (PGP) or algebraic GP (AGP), and usually could be solved only numerically, e.g., by a Newton-Raphson algorithm requiring specific values for c=(c₁, c₂, c₃) and therefore having some of the same disadvantages as the direct approaches.

Traditional numerical approaches do not provide the complete functional dependence of the optimal solution t* and the optimal value P* on the parameter values A, B, R, V that is of interest to designers.

The geometric programming approach of the present disclosure replaces the difficult nonlinear optimality condition ∇P(t)=0 by an equivalent linear optimality condition whose solutions are easily obtainable by elementary linear algebra. This involves an initial step of restating P(t) as P(t)=+T₂(t)+T₃(t) to provide an explicit focus on its “terms,” T₁(t)=c₁t₁ ²t₂ ⁰ T₂(t)=c₂t₁ ¹t₂ ¹ T₃(t)=c₃t₁ ⁻²t₂ ⁻¹. Next, since the derivative of tj^(aij) for tj>0 is simply aijtj^(aij-1), multiplying both sides of the j^(th) component of the equation ∇P(t)=0 by t_(j)>0 for j=1, 2 provides the equivalent linear optimality conditions

2T ₁+1T ₂−2T ₃=0

0T ₁+1T ₂−1T ₃=0

whose coefficient matrix is simply the transpose A^(t) of the exponent matrix A, a result that holds for all generalized polynomials P(t), by virtue of elementary algebra and calculus. The linear way in which only the terms Ti of P appear in these reformulated optimality conditions suggests that the focus on finding the optimal solution (t*₁, t*₂)=t* should shift to finding the optimal terms (T*₁, T*₂, T*₃)=T*. However, since this linear optimality condition A^(t) T=0 is necessary but not sufficient in itself to determine T*, another optimality condition is needed. A key to finding an appropriate linear optimality condition is to use the defining equation P=T₁+T₂+T₃ and the fact that P>0 to infer that T₁/P+T₂/P+T₃/P=1. Then, the linear manner in which only the ratios T_(i)/P appear suggests that the focus on the terms T=(T₁, T₂, T₃) should further shift to the term ratios

y=(y ₁ ,y ₂ ,y ₃) where

y ₁ =T ₁ /P y ₂ =T ₂ /P y ₃ =T ₃ /P

which are simply the fractional parts y_(i) of the objective value P resulting from its terms T_(i) respectively. Dividing the equations in optimality condition A^(t)T=0 by P>0 shows that the optimal ratio vector (y*₁, y*₂, y*₃)=y*>0 satisfies the orthogonality conditions

2y ₁+1y ₂−2y ₃=0

0y ₁+1y ₂−1y ₃=0

as well as the normality condition

y ₁ +y ₂ +y ₃=1.

The linear system consisting of the orthogonality and normality conditions has a unique solution

y* ₁=⅕ y* ₂=⅖ y* ₃=⅖

showing that an optimally designed cylindrical container always has cylindrical side member costs that are twice the combined circular bottom member and top member costs while being identical to the operating costs, invariant relative to the coefficient vector c=(c₁, c₂, c₃) and hence to the exogenous parameters A, B, V, R.

Other important interpretations of the optimal-ratio vector y* will become transparent if its specific value (⅕, ⅖, ⅖) is not used while solving for the optimal value P* and optimal solution t* by the defining equations

y* ₁ =c ₁ t ₁ ² t ₂ ⁰ /P y* ₂ c ₂ t ₁ ¹ t ₂ ¹ /P y* ₃ =c ₃ t ₁ ⁻² t ₂ ⁻¹ /P.

This nonlinear system with the unknowns P and t is a disguised version of an equivalent linear system in the corresponding unknowns log P and log t, and can be obtained simply by equating the logarithm of both sides of each equation, to produce the equivalent log-linear system

log P=log(c ₁ /y* ₁)+2 log t ₁+0 log t ₂

log P=log(c ₂ /y* ₂)+1 log t ₁+1 log t ₂

log P=log(c ₃ /y* ₃)−2 log t ₁−1 log t ₂

This system is readily solved by first solving for log P, by multiplying both sides of its three equations by y*₁, y*₂, and y*₃ respectively, and adding the results, to yield

(y* ₁ +y* ₂ +y* ₃)log P=

y* ₁ log(c ₁ /y* ₁)+y* ₂ log(c ₂ /y* ₂)+y* ₃ log(c ₃ /y* ₃)+

(2y* ₁+1y* ₂−2y* ₃)log t ₁+(0y* ₁+1y* ₂−1y* ₃)log t ₂

which then reduces to

log P=y* ₁ log(c ₁ /y* ₁)+y* ₂ log(c ₂ /y* ₂)+y* ₃ log(c ₃ /y* ₃)

by virtue of the normality condition and orthogonality conditions respectively. Exponentiating both sides of this equation and substituting (⅕, ⅖, ⅖) for y* yields the formula

P*=[5/(2^(4/5))](c ₁ ^(1/5))(c ₂ ^(2/5))(c ₃ ^(2/5))

wherein c ₁=2πA, c ₂=2πB, c ₃=(RV/π)

prior to obtaining a formula for t*. However, substituting the resulting formula for log P* back into the log-linear system from which it was derived produces an over-determined reduced log-linear system in the unknowns log t₁ and log t₂ whose unique solution produces, after exponentiation, a formula for t* in terms of c and hence A, B, R, V.

Post-optimality analysis is now addressed. Since the previously derived entropic formula

log P*=y* ₁ log(c ₁ /y* ₁)+y* ₂ log(c ₂ /y* ₂)+y* ₃ log(c ₃ /y* ₃)

implies, by virtue of the invariance of y* with respect to changes in c, that

∂(log P*)/∂(log c _(i))=y* _(i) for i=1,2,3,

y* provides a sensitivity analysis analogous to that provided by the dual optimal solutions y* in linear programming, whose duality is actually a non-obvious special case of algebraic geometric programming duality. This formula becomes directly meaningful when combined with the chain rule to show that

∂P*/∂c _(i) =[P*][y* _(i)][1/c _(i)] for i=1,2,3

which in turn implies by the multi-variable chain rule that for any exogenous parameter E, such as A, B, R, V,

∂P*/∂E=P*{[y* ₁ /c ₁](∂c ₁ /∂E)+[y* ₂ /c ₂](∂c ₂ /∂E)+[y* ₂ /c ₂](∂c ₂ /∂E)}

In general, for positive polynomial minimization problems with a total number n of terms T_(i) and with at least one positive solution y⁺>0 to the orthogonality conditions A^(t)y=0, the degree of difficulty=n−(rank A)−1, which is simply the dimension of the solution set for the orthogonality and normality conditions. It is zero for the present example and can remain zero as the problem size, determined mainly by both n and the number m of independent variables t_(j), increases without bound. Although a complete analytical solution to this positive polynomial minimization example requires only elementary linear algebra because it has zero degree of difficulty, more difficult positive polynomial minimization problems with positive degrees of difficulty can be more easily analyzed and solved using problem reformulations and transformations analogous to those in this simplest zero degree-of-difficulty example.

Regarding problem reformulations, since log tj has played a key role in the solution of this example, it is natural to reformulate this example in terms of new independent variables

z _(j)=log t _(j) for j=1,2

which define one-to-one mappings from (0+∞) onto (−∞+∞) with inverses

t _(j)=expz _(j) for j=1,2

Substituting expz_(j) for t_(i) into the defining equation for P(t) and then using the laws of exponents gives the equivalent objective function

F(z)=c ₁exp(2z ₁+0z ₂)+c ₂exp(1z ₁+1z ₂)+c ₃exp(−2z ₁−1z ₂)

which is to be minimized as z varies over all of

, giving rise to optimal solutions z* in

, if there are any in the general case. The nature of this formula for F(z) motivates the following linear transformation from z to another new independent vector variable

${x = {\begin{pmatrix} x_{1} \\ x_{2} \\ x_{3} \end{pmatrix} = {{\begin{pmatrix} 2 & 0 \\ 1 & 1 \\ {- 2} & {- 1} \end{pmatrix}\begin{pmatrix} z_{1} \\ z_{2} \end{pmatrix}} = {Az}}}}\mspace{11mu}$

which gives rise to another equivalent objective function

G(x)=c ₁exp(x ₁)+c ₂exp(x ₂)+c ₃exp(x ₃)

which is a strictly convex and separable exponential function that is to be minimized as x varies over all of the column space X of the exponent matrix A, giving rise to optimal solutions x*, if there are any in the general case.

Finally, since log P, and hence its equivalents log F and log G, plays a key role in the preceding solution of this example, and since the log function is monotone increasing, it is natural to reformulate this objective function G(x) as another equivalent objective function

g(x)=log [G(x)]=log [c ₁exp(x ₁)+c ₂exp(x ₂)+c ₃exp(x ₃)]

which is still strictly convex but is now asymptotically linear while being exponentially separable and still to be minimized as x varies over all of the column space X of the exponent matrix A, giving rise to the same optimal solutions x*, if there are any in the general case.

The following theoretical observations and facts are noted: (i) an optimal value P* exists if, and only if, at least one of the other optimal values F*, G*, and g* exists, in which case all of these optimal values exist, and P*=F*=G*=exp g*; (ii) an optimal solution t* exists if, and only if, an optimal solution z* exists, in which case all of the optimal solutions t* are in one-to-one correspondence with all of the optimal solutions z*, with the corresponding optimal components satisfying the equations t*₃=exp z*_(j) for j=1, 2; (iii) an optimal solution z* exists if, and only if, an optimal solution x* exists, in which case each optimal solution z* is a solution to the linear system Az=x* for some optimal x*, and each solution z to such a system Az=x* is optimal; (iv) if an optimal solution x* exists, it is unique because of the strict convexity of g(x), but since the set Z* of all optimal solutions z* is simply the solution set for the resulting unique linear system Az=x*, the optimal solution set Z* is a singleton if, and only if, the exponent matrix A has full column rank, with a corresponding statement being valid for the set T* of all optimal t*; and (v) an optimal solution x* exists if, and only if, there is a positive dual vector y⁺>0 that is in the orthogonal complement X^(⊥) of X, namely, the solution set Y for the orthogonality conditions.

Concerning computational observations and facts, it is noted that the analytical or numerical minimization of either G(x) or g(x) over the vector space X, the column space of the exponent matrix A, is generally preferred over either the minimization of F(z) over

^(m) or the minimization of P(t) over all t>0. This is due to the fact that X has infinitely many different linear-algebraic representations that provide vector parallel processing as a consequence of the separability of both G and g, whose “convexities” also provide easily computed Legendre-Fenchel conjugate transforms H and h, respectively. Such conjugate transforms play key roles in the construction of geometric programming dual problems that generalize linear programming dual problems in very non-trivial ways, and are at least as useful.

In various aspects, the disclosure relates to computational systems, computer-implemented methods, and computer-readable media for conducting mixed integer linearly constrained convex programming (MILCCP), including applications of such programming to MILCCP applications, e.g., Markowitz portfolio selection, and other applications, in which MILCCP techniques are reduced to unconstrained convex programming.

Such aspects of the disclosure relates to an approach in which a “relaxation” (without the integer constraints) is easily reformulated in a revolutionarily new way as an equivalent convex generalized geometric programming (GGP) problem whose corresponding conjugate dual problem is unconstrained and hence can be solved with any unconstrained convex programming algorithm, after which a “cutting plane” resulting from an easily-computed primal-optimal solution readily produces a “dual-update” that is also unconstrained, and hence solvable without (relatively inefficient) “branching”, “bounding”, and “cold re-starting”. During each such iteration, vector parallel-processing can be used directly, without pre-processing, to solve extra-large-scale or multi-scale problems previously unsolvable in real time. Some related stochastic MILCCP problems can also be solved by this GGP methodology.

Linearly constrained convex programming (LCCP) is illustrated for a canonical LCCP problem with relaxed (and hence no) integer constraints, by Problem P below.

$\begin{matrix} {{{Minimize}\mspace{14mu} {f(x)}\mspace{14mu} {subject}\mspace{14mu} {to}}\mspace{14mu} {{Ax} = b}{1 \leq x \leq u}{where}{{1 \in ^{n}},{u \in ^{n}},{A \in ^{m \times n}},{b \in ^{m}},{{{and}\mspace{14mu} x} \in ^{n}}}} & {{Problem}\mspace{14mu} P} \end{matrix}$

while f is convex on its convex domain F⊂

with F typically being either VI or restricted by finitely many (desirably 0) inequalities r(x)≦0 on convex functions r:R where R⊂

Note: P is in “standard form” when l=0 and u=+∞, P can model any LP problem when f(x)=c^(T)x+d and F=

or F={x∈

|l≦x≦u}

The key reformulation is set out below.

Expand X=(x ₁ ,x ₂ , . . . ,x _(n))∈

into X=(x ₁ ,x ₂ , . . . ,x _(n) ,x _(n+1))∈

This permits a reformulation off (x) as the “equivalent”

f(x ₁ ,x ₂ , . . . ,x _(n))+0x _(n+1) with domain F×{1}⊂

which provides the extremely important reformulation of A x=b as

[A|−b]x=0 (because x _(n+1)∈{1})

Now, defining

X={x∈

|[A|−b] x=0}—a vector sub-space of

completes the key reformulation of Problem P as the following generalized geometric programming GGP Problem P Minimize f(x₁, . . . , +0 x_(n+1) subject to

(x ₁ , . . . ,x _(n) ,x _(n+1))∈X∩F×{1}

Next, orthogonal complement and conjugate transform operations are performed. To get the (highly desired) UCP form for the general example P of LCCP, the GGP conjugate dual Q of P is constructed—from the orthogonal complement Y of X and the conjugate transform h:H of f(x₁, . . . , x_(n))+0x_(n+1):F×{1}. In particular,

Y={y∈

|y=z ^(T) [A|−b] for some z∈

}

(namely the “row space” of [A|−b]) and

H={y∈

|sup_(x∈F×{1}) [y ^(T) x−f(x ₁ , . . . ,x _(n))−0x _(n+1)]<+∞},

with

h(y)=sup_(x∈F×{1}) [y ^(T) x−f(x ₁ , . . . ,x _(n))−0x _(n+1)].

The GGP conjugate dual Q of P then is addressed. Problem Q Minimize h(y) subject to

y∈Y∩H

where

Y={y∈

|y=z ^(T) [A|−b] for some z∈

}

(simply the “row space” of [A|−b])

h(y)=sup_(F∈F×{1}) [y ^(T) x−f(x ₁ , . . . ,x _(n))−0x _(n+1)]

=sup_(x∈F)[Σ₁ ^(n) y _(j) x _(j) −f(x ₁ , . . . ,x _(n))]+y _(n+1) when

(y₁, . . . , y_(n))∈domain D of conjugate transform of f:F

H={y∈

|(y ₁ , . . . ,y _(n))∈D and y _(n+1)∈

}

with H=

when D=

→vector-processing

A key corollary of the foregoing is therefore derivable. Given that 0∈(int H), Problem Q is super consistent; but Q is bounded if, and only if, Problem P is consistent—in which case there is no duality gap between P and Q (in that their minima sum to 0). If P is, in fact, consistent, then Q has a non-empty optimal solution set T*; and there is an optimal solution x* to P that is a “sub gradient” of h(y) at a y* in T*—with this x* actually being the gradient ∇h(y*) when h is differentiable at y*.

As a key observation, if each component u_(j) of u is initially chosen to be at least as large as an estimated optimal value x*_(j) of x_(j) for j=1, 2, . . . , n), then D=

and hence 0∈(int H)→vector-processing.

Summarizing the foregoing methodology, the relaxed problem P (without the integer constraints) is solved by solving its GGP dual Q.

1. If the integer constraints are satisfied by a gradient (or sub-gradient) x* of h(y*) for a y* in T*, the original MILCCP problem has been solved by x*. 2. If the integer constraints are not satisfied by x*, then compute a “cutting plane” and use its data to attach an additional row_(m+1) to [A|−b] while also attaching an additional component z_(m+1) to Z. 3. Use a “hot start” at (z*₁, z*₂, . . . , z*_(m), 0) to begin solving the new GGP dual Q₂, and repeat as necessary to obtain a solution.

The linear programming case is summarized below.

Choose F={x∈

l≦x≦u} and f(x)=c ^(T) x+d

Then, H=

and

h(y)=[Σ₁ ^(n)max{l _(j)(y _(j) −c _(j)),u _(j)(y _(j) −c _(j))}]+y _(n+1) −d

The resulting Problem Q has as its objective function this convex piece-wise linear (with 2n pieces) function h(y) that can, of course, be reformulated as a purely linear function in a usual way (by introducing n additional decision variables). Since H=

and hence 0∈(int H), vector-processing can be exploited.

In a specific aspect, the disclosure contemplates a methodology of reducing dynamic convex programming with linear transition equations (DCPwLTE) to unconstrained convex programming with vector processing (UCPwVP).

In this methodology, the resulting UCPwVP algorithm for DCPwLTE consists mainly of iteratively computing linear combinations of the row vectors in a fixed partitioned matrix that describes the linear transition equations. Each such partition actually consists of component row vectors that are all zero—except for the n partitions that are in a one-to-one correspondence with the n “dynamic stages” of the given DCPwLTE problem. Consequently, this algorithm can be greatly accelerated by the “vector-processing” of rows that contain nearly all zeros—greatly reducing the size of Richard Bellman's “curse of dimensionality” for DCPwLTE. Moreover, if the original DCPwLTE objective function is separable into a sum of functions that are in a one-to-one correspondence with the n dynamic stages, then its corresponding UCPwVP objective function (a generalized geometric programming conjugate-dual function) is also separable—and hence can be computed even more efficiently with single parallel-processing of the results of the iterative vector-processing.

The reduction is illustratively described below.

Given a DCPwLTE system that is initially in (vector) “state” s¹, a (vector) decision d¹ causes a system transition from into the state

s ² =A ₁ s ¹ −B ₁ d ¹

where the matrices A₁ and B₁ are given system matrices. After (n−1) such transitions

s ^(k) =A _(k-1) s ^(k-1) −B _(k-1) d ^(k-1) for k=2,3,4, . . . ,n

the resulting column vector variable x=[s¹ d¹ s² d² s³ d³ . . . d^(n−2) d^(n−2) s^(n−1) d^(n−1) s^(n)]^(T) clearly satisfies the linear homogeneous system

     Ax = 0      where $A = {\quad\; \begin{pmatrix} A_{1} & {–B}_{1} & {–I}_{1} & 0 & 0 & 0 & \ldots & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & A_{2} & {–B}_{2} & {–I}_{2} & 0 & \ldots & 0 & 0 & 0 & 0 & 0 \\ . & \; & \; & \; & \; & \; & \; & \; & \; & \; & \; & . \\ . & \; & \; & \; & \; & \; & \; & \; & \; & \; & \; & . \\ . & \; & \; & \; & \; & \; & \; & \; & \; & \; & \; & . \\ . & \; & \; & \; & \; & \; & \; & \; & \; & \; & \; & . \\ 0 & 0 & 0 & 0 & 0 & 0 & \ldots & A_{n - 2} & {–B}_{n - 2} & {–I}_{n - 2} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & \ldots & 0 & 0 & A_{n - 1} & {–B}_{n - 1} & {–I}_{n - 1} \end{pmatrix}}$

If x is also required to be in the domain G of a given closed and convex objective function g:G that is to be minimized subject to A x=0, the most general DCPwLTE problem that can be considered here is presented, as a generalized geometric programming GGP problem.

Primal Problem P: Minimize g(x) for x∈X∩G where the vector space X={x|Ax=0}. The GGP dual problem is defined in terms of both the orthogonal complement Y of X and the conjugate transform h:H of g:G using the row vector variable y=[t¹ e¹t² e² t³ e³ . . . t^(n−2) e^(n−2) t^(n−1) e^(n−1) t^(n)] Dual Problem Q: Minimize h(y) for y∈Y∩H where the vector space Y={y|y=zA for some z}—namely, the row space of A. The calculation of z A then can exploit vector processors.

References relevant to the foregoing include Elmor L. Peterson, “Symmetric Duality for Generalized Unconstrained Geometric Programming”, SIAM Jour. Applied Math., 19, 487 (1970) and Elmor L. Peterson, “Geometric Programming”, SIAM Rev., 19, 1 (1976), reprinted in Advances in Geometric Programming (M. Avriel, Ed.), Plenum Press, 31 (1980), and John Joseph Dinkel, “A Duality Theory for Dynamic Programming Problems via Geometric Programming”—A dissertation (under the guidance of Elmor L. Peterson) submitted to the Northwestern University Graduate School in partial fulfillment of the requirements for a PhD degree in Applied Mathematics, June (1971), the disclosures of which are hereby incorporated herein by reference, in their respective entireties.

The present disclosure thus relates in another aspect to a computer-implemented method for generating a solution output on a graphical user interface or other output component of a programmed vector parallel-processing computer to a mixed integer linearly constrained convex programming (MILCCP) application, wherein the programmed vector parallel-processing computer is programmed to reduce the MILCCP application to an unconstrained convex programming application, and wherein said method comprises automatic operation of the programmed vector parallel-processing computer to reformulate the MILCCP application as an equivalent convex generalized geometric programming (GGP) problem whose corresponding conjugate dual problem is unconstrained, and automatic operation of the programmed computer to solve the GGP problem with an unconstrained convex programming algorithm, by vector parallel-processing.

The output component of the programmed vector parallel-processing computer in such method may comprise a storage component, storage medium, cloud storage capability, or other output retention structure, medium, or capability of appropriate character.

Additional aspects of the disclosure relate to a programmed vector parallel-processing computer programmed to carry out such method, and computer-readable media and processors containing instructions for carrying out such method in a vector parallel-processing computer.

Such method and associated programmed vector parallel-processing computer may for example be directed to Markowitz portfolio selection applications, or other MILCCP applications.

More generally, the vector processing algorithmic methodology described herein can be applied to a wide variety of system modeling and optimization problems, including linear programming as well as nonlinear programming problems, exa-scale generalized geometric programming problems, etc. Illustrative problems include, without limitation: multi-scale climate modeling including land, water, and atmospheric climatological components; modeling of controlled hydrogen fusion; modeling of physiological processes; modeling of nanoscale electronic devices and microelectromechanical systems (MEMS); rational drug design; design of materials such as high temperature superconductors, quantum computing components, structural nanomaterials, etc.; modeling of planetary, interstellar, and galactic systems; design and optimization of particle accelerators; modeling of population dynamics; design of pandemic intervention models; optimization of global supply chains; design of multinational disaster response systems; modeling of seismological activity and events; design and optimization of robotic systems; optimization of resource allocation in resource-constrained environments; political forecasting; modeling combustion energy systems; cosmology applications of modeling dark energy and dark matter interactions and environments; modeling exaJoule-level energy flows; design and optimization of exaFLOP computing systems; modeling of nanoscale sensor monitoring systems, e.g., for applications such as environmental assessment, telecommunications, and health management; as well as real-time solution of numerous other large-scale and multi-scale problems of economic and physical equilibration, profit maximization, and cost minimization that have been previously unsolvable.

While the disclosure has been set forth herein in reference to specific aspects, features and illustrative embodiments, it will be appreciated that the utility of the disclosure is not thus limited, but rather extends to and encompasses numerous other variations, modifications and alternative embodiments, as will suggest themselves to those of ordinary skill in the field of the present disclosure, based on the description herein. Correspondingly, the disclosure as hereinafter claimed is intended to be broadly construed and interpreted, as including all such variations, modifications and alternative embodiments, within its spirit and scope. 

What is claimed is:
 1. A computer-implemented method for numerical solution of a geometric programming problem, comprising the computer-implemented steps of: reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and solving the equivalent generalized geometric programming optimization problem by vector processing, comprising determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.
 2. The method of claim 1, wherein the geometric programming problem comprises at least one of design, modeling, and optimization of a system.
 3. The method of claim 2, wherein the system comprises an exa-scale system.
 4. The method of claim 1, wherein the geometric programming problem comprises a stochastic linear programming optimization problem.
 5. The method of claim 1, wherein the geometric programming problem comprises multi-scale climate modeling.
 6. The method of claim 1, wherein the geometric programming problem comprises modeling of controlled nuclear fusion.
 7. The method of claim 1, wherein the geometric programming problem comprises a problem selected from the group consisting of: economic equilibration, physical equilibration, profit maximization, and cost minimization.
 8. The method of claim 1, wherein the geometric programming problem comprises a problem selected from the group consisting of: modeling of physiological processes; modeling of nanoscale electronic devices and microelectromechanical systems (MEMS); pharmaceutical drug design; design of materials selected from the group consisting of high temperature superconductors, quantum computing components, and structural nanomaterials; modeling of planetary, interstellar, and galactic systems; design of particle accelerators; modeling of population dynamics; design of pandemic intervention models; optimization of global supply chains; design of multinational disaster response systems; modeling of seismological activity and events; design and optimization of robotic systems; optimization of resource allocation in resource-constrained environments; political forecasting; modeling of combustion energy systems; modeling of dark energy and dark matter interactions and environments; modeling of exaJoule-level energy flows; design and optimization of exaFLOP computing systems; and modeling of nanoscale sensor monitoring systems.
 9. The method of claim 1, wherein said solving comprises convex optimization of a non-convex geometric programming problem.
 10. The method of claim 1, comprising Rockafellar bi-function programming.
 11. The method of claim 1, comprising fuzzy optimization.
 12. The method of claim 1, wherein the vector processing produces an optimal solution that is used to solve a deterministic primal problem of the geometric programming problem by a linear programming method.
 13. The method of claim 1, wherein said computer-implemented steps are conducted on a parallel processing computer.
 14. A computer-readable storage device embodying a non-transitory program of machine-readable instructions executable by a digital processing apparatus to perform a method of numerical solution of a geometric programming problem, the method comprising: reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and solving the equivalent generalized geometric programming optimization problem by vector processing, comprising determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.
 15. The computer-readable storage device of claim 14, selected from the group consisting of random access memory, magnetic data storage diskettes, CD discs, hard drives, RAID arrays, magnetic tape, electronic read-only memory, and optical storage devices.
 16. A computer program product comprising a computer program for enabling numerical solution of a geometric programming problem, the computer program comprising: instructions for reformulating the geometric programming problem as an equivalent generalized geometric programming optimization problem with only linear constraints, and instructions for solving the equivalent generalized geometric programming optimization problem by vector processing, comprising instructions for determining by computer-implemented numerical computation a solution for an unconstrained objective function whose independent vector variable is the generalized geometric programming conjugate dual of a primal decision vector variable of the geometric programming problem, and comprises a variable linear combination of fixed vectors enabling the vector processing.
 17. A computer system comprising a memory and parallel processors that are programmatically adapted by the computer program product of claim 16 to generate the numerical solution of the geometric programming problem.
 18. The computer system of claim 17, wherein the geometric programming problem comprises at least one of design, modeling, and optimization of an exa-scale system.
 19. The computer system of claim 18, wherein the geometric programming problem comprises a stochastic linear programming optimization problem.
 20. A computer-implemented method for generating a solution output on a graphical user interface or other output component of a programmed vector parallel-processing computer to a mixed integer linearly constrained convex programming (MILCCP) application, wherein the programmed vector parallel-processing computer is programmed to reduce the MILCCP application to an unconstrained convex programming application, and wherein said method comprises automatic operation of the programmed vector parallel-processing computer to reformulate the MILCCP application as an equivalent convex generalized geometric programming (GGP) problem whose corresponding conjugate dual problem is unconstrained, and automatic operation of the programmed computer to solve the GGP problem with an unconstrained convex programming algorithm, by vector parallel-processing.
 21. The computer-implemented method of claim 20, wherein the MILCCP application comprises Markowitz portfolio selection. 